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RAW SEQUENCE LISTING 
ERROR REPORT . 




The Biotechnology Systems Branch of the Scieqtific and Technical Information 
■ ■ V Center (STIC) detccited em« whi^ following computer readable 

form: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4Z12. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308^216. 
PATENTIN 2.1 e-mail help: Datin21help@uspto.gOV or phone 703-306-41 19 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(auspto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS. PLEASE USE THE CHECKER 
VERSION 3a PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDIIESS: 
http;//www,uspto.gov/web/ofIices/pac/checker 

Applicants submitting genetic sequence infomiation electronically on diskette or CD-Rom should be aware thai there is 

a possibility-that^e disk/CD-Rom may have been affected by treatment given to all inconiing mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom, including: 

1. EFS-Bio (<http://www,uspto,gov/ebc/cfs/downloads/documents,htm^ EPS Submission 

User Manual - cPAVE) 

2. U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: u, • 

U.S. Patent and Trademark OfTice. Technology Center 1 600, Reception Area, 7 Floor. Examiner Name. 
Sequence Infonnation, Ciystal Mall One. 191 1 South Clark Street, Arlington, VA 22202 
Or 

' U.S. Patent and Trademark OfTicc, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark OfTice, 
Box Sequence, Room 1B03-Mai!room. Ciystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



Date Processed by STIC: t 



Application Serial Number: / o/ 0/S\ 



Source: 
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lUw Sequence LbCinc Error Summary 



ERROR bETECTED SUTCEgTED OORRECnON SERIAL NUMBER: . 

ATTN: Jew rules CASES: PLEASE DISREGARD ENGLISH -ALPHA" HEADERS, WIIIOI WERE INSERTED DY PTO SOFTWAF 



.Wnpped Nudciei TYw numbcrActf it Ihe end efcAch line *Svnppcd^ down to the ncU line. This may eeeur if your file 

Wrapped Aminos %vms rdricvcd in a «vord processor afUr creating it Please a^utf your riglil margin lo .3; this will 
prcvcitt '\inppiAg.'' 

- a I nvilid Line Length The rules require that a line nor cccecd 72 charadcn in iengvK. this includes vrhtte sfuces. 




Misaligned Amino 
Numbering 

Non-ASCII 



^Viriable Length 



6 Patentin 2.0 

-bug" 



7 Skipped Scctucnces 

(OLD RULES) 



8j Skippcil Sequences' 

' (NEW RULES) 



Use ofn't or Xaa'i 



10 



II 



(NEW RULES) 



_Invi|id<2n> 
Response 



Useor<220> 



t2 



Palcntin 2.0 
" ••bug- 



The numbcsing under each 5* amino add is misatigncdL Do not use lab codes between numbers; 
use apace characfcn. instead. — — 

The Bubmined file vms not saved in ASCn(PpS) text, as rcqulrrd by the Scquenee- Rules. Please 
cfbure y»»r aribaeqiien^ 

Sequence(i) ^contain n's or Xaa's rcpreicnting more iluK one residue. Per Sequence Rules, 

each B or Xaa e«n only rrprcacnt a single rnlduB..'PleaM present the maximum number ofeadi 
residue having varisbic length and indicate in the <2^0>-<223> section that some may be missing. 

A "bug" in Patentin version 2.0 has cfM^ed Oft <220>-<223>scdion lo be missing from amino acid 
S6qucnccs(sj . Normally, Psteilt^ 'would^ automatically, giesientc this section from the 

previously coded nudetc sdd sequence. Please manually copy the relevaiit <230>-C323> section to 
ihc subsequent amino add sequence. This applies lo the mandatory O20>-<223> aecllons for 
Arlindal or Unknown aeqocncei. 

Scqucncc(s) ^ missing. Ifinlenlionsl, please insert the follo.wing lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO;X: (insert SEQ ID NO wAere "X" is shown) 
(i) sequence CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(Ki) SEQUENCE DESCRJPnON:SEQ ID NO:X: (insert SEQ ID NO where •'X" is shown) 
This sequence is intentionally skjpped 

Please also adjust the "(ii) NUMBER OF SE(jUENCES:" response lo Include the skipped sequences. 



Sequcnce(s) _ 



missing. I f In ten! Jo naJ. please insert the following i'lnes Tor each skipped scqijenec. 



<1 10> sequence id number 
C400> sequence id number 
000 

Use ofn's and/or Xsa's have been detected in the Sequence Listing. 

Per 1.S23 ofScquenee Rules, use or<220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> lo <223> section, please explain location ofn or Xaa. and which residue n or Xaa represents. 

Per 1.823 or Sequence- Rules, the only valid <2I3> responses are: Unknown. ArtiFicial Sequence, or 
scientific name (Gcnus/spcdcs). <220>*<223> section is required when <2 13> response is Unknown or 
is Aftflidll Sequence 



Sequence(s) . 



_ missing the <220> **Fcilurc" and associitcd numeric identifiers and responses. 



Use or<220> 10 <223> is MANDATORY if <2I3> •'Organism** response is "Artifirial Scquenee" or 
"Unknown." Please explain source of genetic malerisi in <220> to <223> section. 
(See **Fcderal Regi0er,'*O6A>l/1998, VoL 63. No. 104, pp. 29631-32} (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentin version 3.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Indead. please use "File Manager** or any other manual means lo copy file lo floppy disk. 



1 3 - * M isuse of^in \ n can only be used to represent a single nuclcoClde in a nucleic acid sequence. N is not used lo represent 

any value not •pedneally a nudeoiide. 



AMCVMH - Biotechnology Sytfcms Branch - 08/21/2001 
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RAW SEQUENCE LISTING DATE: 01/03/2002 

PATENT APPLICATION: US/10/015,989 TIME: 15:41:56 ^ 



Input Set : A:\ES.tzt 

Output Set: Nc\CRF3\0103200AJ0159a9.raw 




APPLICANT : . DARROW , ANDREW 
QI, JENSON 

ANDRADE- GORDON, PATRICIA 

TITLE OF INVENTION: ZYMOGEN ACTIVATION SYSTElM .. 
FILE REFERENCE: ORT-1552 

CURRENT APPLICATION NUMBER: US/10/015,989 • ^ 

CURRENT FILING DATE: 2001-12-10 " ^ 

NUMBER OF SEQ ID NOS : 60 DoeS NOt COmpIV -j^/'^^- 

SOFTWARE: PATENTIN VER. 2 0 f^m.w^^^^ r%!^i„^ . . ^ P ' 




2 ° Corrected Diskette Needed 



ERRORED SEQUENCES 



52 <210> SEQ ID NO: 1 ' / . ^ fi^ ^ . Q,,^ ^ 

LENGTH: 361 M >^ MUMtC^ ^^Ma^ 



55 <211> LENGTH: 361 

58 <212> TYPE: DNA 

61 <213> ORGANISM: ARTIFICIAL SEQUENCE 

67 <220> FEATURE: 



gtggtgtcaa atictactctt: gtgccaggg^ atgqtctc cq act:a '^gif g^a — ^ /'Z^ <^ (j \ -fl 

cgacgacgac 120 . Jl/iA^tTU \ >C/<-*^ J~ 



7Q <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQIJENCE:. FUSION GENE 
73"' ■ VECTORis',""/ . •- 
79 <400> SEQUENCE: 1 . 
E--> 82 gaattcacca ccatggacag caaaggttcg tcgcagaaat cccgcctgct ^^^^^yiV7/t^JL>7 I nil 

83 cctgctgctg 60^ . 1 ^ yjfi'Um^l f /J^LA^ 

E--> 86 

87 

E--> 90 gtggacgcgg ccgctcttgc tgcccccttt gatgatgatg acaagatcgt 
91 tgggggctat ISO 

94 gctctagata gcggccgctt ccctttagtg agggttaatg cttcgagcag 

95 acatgataag 240 
E*-> 98 atacattgat gagtttggac aaaccacaac tagaatgcag tgaaaaaaat 

99 gctttatttg 300 

E--> 102 tgaaatttgt gatgctattg ctttatttgt aaccattata agctgcaata ^ 

103 aacaagttga 360 

106 c 361 
112 <210> SEQ ID NO: 2 
115 <211> LENGTH: 301 
118 <212> TYPE: DNA 

121 <213> ORGANISM: ARTIFICIAL SEQUENCE 
127. <220> FEATURE: 

130 <223> OTHER INFORMATION: DESCRIPTION OP ARTIFICIAL SEQUENCE: FUSION GENE 
133 VECTORS. 
139 <400> SEQUENCE: 2 




E--> 



142 gaattcacca tgaatccact cctgatcctt acctttgtgg cggccgctct 0 I qjxjsjaJ 

14 3 tgctgccccc 60 ^^y^^^^^''^^ x'V^^T^'^^^ 



E--> 14 6 tttgatgatg atgacaagat cgttgggggc tattgtctag atacccctac «^ (J 

147 gatgtgcccg 120 
E--> 150 attacgccta gcggccgctt ccctttagtg agggttaatg cttcgagcag 



file://C:\Crf3\Outholdv.VsrJ0 1 5989-htm 
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RAW SEQUENCE LISTING DATE: 01/03/2002 

PATENT APPLICATION: US/10/015,989 TIME: 15:41:56 

Input Set : A:\ES.txt 

output set; N:\CRF3\01032002\J015989.raw 

151 acatgataag 180 
E--> 154 atacattgat gagtttggac aaaccacaac tagaa^gcag ^gaaaaaaat 

155 gctttatttg 240 
E--> 158 tgaaatttgt gatgctattg ctttatttgt aaccattata agctgcaata 

159 aacaagttga 300 

162 c " 
168 <210> SEQ ID NO: 3 

-171 <211> : LENGTH:. 484 -:- " . ■ / . •-•.r^-: - • ■ " ^r-.' .v-C .-■-o:;:-:-.- 
174 <212> TYPE: DNA 

177 <213> ORGANISM: ARTIFICIAL SEQUENCE 

183 <220> FEATURE: " 

186 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: FUSION GENE 
189 VECTORS. 
195 <400> SEQUENCE; 3 



E--> 
E--> 
E--> 
E — > 
E--> 
E--> 



199 cctgctgctg 60 

202 gtggtgtcaa atct 

203 cgacgacgac 120 

206 gtggacgcgg ccgc 

207 gggctcggat 180 

210 ctagataccc ctac 

211 gtgcccgatt 240 

214 acgccgctag atac 

215 tacgatgtgc 300 



caaaggttcg 


tcgcagaaat 


cccgcctgct 


gtgccagggt 


gtggtctccg 


actacaagga 


tgcccccttt 


atcgaggggc 


gcattgtgga 


cccgattacg 


ccgctagata 


cccctacgat 


gatgtgcccg 


attacgccgc 


tagatacccc 


cttcccttta 


gtgagggtta 


atgcttcgag 


gacaaaccac 


aactagaatg 


cagtgaaaaa 


ttgctttatt 


tgtaaccatt 


ataagctgca 



484 



219 cagacatgat 360 
E--> 222 aagatacatt gatgagtl 

223 aatgctttat 420 
E--> 226 ttgtgaaatt tgtgatg< 

227 ataaacaagt 480 

230 tgac 

236 <210> SEQ ID NO: 4 

239 <211> LENGTH: 382 

242 <212> TYPE: DNA 

24 5 <213> ORGANISM: ARTIFICIAL SEQUENCE 
251 <220> FEATURE: 

254 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: FUSION GENE 
257 VECTORS. 
26 3 <4 00> SEQUENCE: 4 
E--> 266 gaattcacca ccatggacag caaaggttcg tcgcagaaat cccgcctgct 

267 cctgctgctg 60 
E--> 270 gtggtgtcaa atctactctt gtgccagggt gtggtctccg actacaagga 
271 cgacgacgac 120 



E--> 
E--> 



271 cgacgacgac 120 a M^^i-^ 

21 A gtggacgcgg ccgctcttgc tgcccccttt gatgatgatg acaagatcgt /J^ 

275 tgggggctac 180 . ^ 

278 aactgtctag acatcaccat caccatcact agcggccgct tccctttagt 

279 gagggttaat 240 

282 gcttcgagca gacatgataa gatacattga tgagtttgga caaaccacaa 

283 ctagaatgca 300 



file://C:\Crf3\Outhold\VsrJ015989.htm 
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RAW SEQUENCE LISTING DATE: 01/03/2002 

PATENT APPLICATION: US/IO/015, 989 TIME: 15:41:56 



Input Set : A:\ES.tzt 

Output Set: N:\CRr3\01032002\J015989-raw 



E--> 286 g^gaaaaaaa tgctttattt gtigaaattit^g tgatgctatt gctttatttg 
287 taaccattat 360 

290 aagctgcaat aaacaagttg ac . 382 

296 <210> SEQ ID NO: 5 
299 <211> LENGTH: 352 

302 <212> TYPE: DNA - " 
305 <213> ORGANISM: ARTIFICIAL SEQUENCE 

.-c311 .<2:20> -FEATURE^:. •■• ■ ..^i^^v-^n^; ■■ ■ - .■ ' -^-v- H.--. " 

314 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: FUSION GENE 
317 VECTORS. 

323 <400> SEQUENCE: 5 - y 

E--> 326 gaattcacca ccatggcttt cctctggctc ctctcctgct gggccctcct 

327 gggtaccacc 60 
E--> 330 ttcggct-gcg gggtccccga ctiacaaggac gacgacgacg cggccgctct 

331 tgctgccccc 120 
E--> 334 tttgatgatg atgacaagat: cgttgggggc tatgctctag acatcaccat 

335 caccatcact 180 
E--> 338 agcggccgct. tccctttagt gagggttaat gcttcgagca gaca^gatiaa 

339 gatacattga 240 
E--> 342 tgagtttgga caaaccacaa ctagaatgca gtgaaaaaaa tgctttat:t:t 

343 gtgaaatttg 300 
E--> 34 6 tgatgctatt gctttatttg taaccattat; aagctgcaat aaacaagttg 

347 ac 352 
_ : , -^353 <2.i0> SilQ. ID NO- .6.-. ■ • .,. v.-.^.v ... • c;. -. .... .. ■ ^i. .. : 

356 <2li> LENGTH: 385 

359 <212> TYPE: DNA 

362 <213> ORGANISM; ARTIFICIAL SEQUENCE 
368 <220> FEATURE: 

371 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: FUSION GENE 

374 VECTORS. 

380 <400> SEQUENCE: 6 
E--> 383 gaattcacca ccatggcttt cctctggctc ctctcctgct gggccctcct 

384 gggtaccacc 60 
E--> 387 ttcggctgcg gggtccccga ctacaaggac gacgacgacg cggccgctct 

388 tgctgccccc 120 
E--> 391 tttgatgatg atgacaagat cgttgggggc tatgctctag atacccctac 

392 gatgtgcccg 180 
£-^> 395 attacgccgc tagacatcac catcaccatc actagcggcc gcttcccttt 

396 agtgagggtt 24 0 
E--> 399 aatgcttcga gcagacatga taagatacat tgatgagttt ggacaaacca 

400 caactagaat 300 
E--> 4 03 gcagtgaaaa aaatgcttta tttgtgaaat ttgtgatgct attgctttat 

404 ttgtaaccat 360 

407 tataagctgc aataaacaag ttgac 385 
413 <210> SEQ ID NO: 7 
416 <211> LENGTH: 1169 
419 <212> TYPE: DNA 

422 <213> ORGANISM: ARTIFICIAL SEQUENCE 
428 <220> FEATURE: 




file://C:\Crf3\Ou tholcKVsrJO 1 5989.htm 



1/3/02 



08/a9/04 MON 11:52 FAX 703 308 4221 PTO/STIC 



RAW SEQUENCE LISTING DATE: 01/03/2002 

PATENT ARRtlCATrON: US/10/015,989 TIME: 15:41:56 ... 

^ • 
Input Set : A:\ES.txt 

Output Set: N:\CRF3\01032002\J015989-raw 

431 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: FUSION T3ENE 
4 34 WITH HOMO iSAPIEN SERINE PROTEASE CATALYTIC DOMAIN 

4 40 <400> SEQUENCE: 7 
E--> 44 3 gaattcacca ccatggacag caaaggttcg tcgcagaaat cccgcctgct 
444 cctgctgctg 60 

E--> 44 7 gtggtgtcaa atctactctt. gtgccagggt gtggtctccg actacaagga — ^ - 

448 cgacgacgac 120 

. ...7:E-:^> 451 gtggacgcgg ccgctcttgcv tgceccclrtt- gatgatgatg acaag^ "-"-- rj : 

4 52 tgggggctat 180 
E«-> 4 55 gctctagagg ccggtcagtg gccctggcag gtcagcatca cctatgaagg 

4 56 cgtccatgtg 240 ' V 

E--> 4 59 tgtggtggct ctctcgtgtc tgagcagtgg gtgctgtcag ctgctcactg 

4 60 cttccccagc 300 
E--> 4 63 gagcaccaca aggaagccta tgaggtcaag ctgggggccc accagctaga 

4 64 ctcctactcc 360 
£.-> 4 67 gaggacgcca aggtcagcac cctgaaggac atcatccccc accccagcta 

4 68 cctccaggag 420 
E--> 4 71 ggctcccagg gcgacattigc actcctccaa ctcagcagac ccatcacctt 

4 72 ctcccgctac 480 
E--> 4 75 atccggccca tctgcctccc tigcagccaac gcctccttcc ccaacggcct 

4 76 ccactgcact 540 
E--> 4 79 gtcactggct ggggtcatgt ggccccctca gtgagcctcc tgacgcccaa 

4 80 gccactgcag 600 
E-T> 483 caactcgagg :.tgcctctgat cagtcgtgag acgtgtaact gcctgtacaa 

484 catcgacgcc 660 
E--> 4 87 aagcctgagg agccgcactt tgtccaagag gacatggtgt gtgctggcta 

488 tgtggagggg 720 
E--> 4 91 ggcaaggacg cctgccaggg tgactctggg ggcccactct cctgccctgt 

4 92 ggagggtctc 780 
E--> 4 95 tggtacctga cgggcattgt gagctgggga gatgcctgtg gggcccgcaa 

496 caggcctggt 840 
E--> 4 99 gtgtacactc tggcctccag ctatgcctcc tggatccaaa gcaaggtgac 

500 agaactccag 900 
E--> 503 cctcgtgtgg tgccccaaac ccaggagtcc cagcccgaca gcaacctctg 

504 tggcagccac 960 
E--> 5 07 ctggccttca gctctagaca tcaccatcac catcactagc ggccgcttcc 

508 ctttagtgag 1020 
E--> 511 ggttaatgct tcgagcagac atgataagat acattgatga gtttggacaa 

512 accacaacta 1080 
E--> 515 gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga tgctattgct 

516 ttatttgtaa 1140 

519 ccattataag ctgcaataaa caagttgac 1169 
525 <210> SEQ ID NO: 8 
528 <211> LENGTH: 114 2 
531 <212> TYPE: DNA 

534 <213> ORGANISM: ARTIFICIAL SEQUENCE 
540 <220> FEATURE: 

54 3 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: FUSION GENE 
54 6 WITH HOMO SAPIEN SERINE PROTEASE CATALYTIC DOMAIN 



@|007 
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RAW SEQUENCE 1,1 STING 

PATENT APPLICATION: US/10/015, 989 



DATE: 01/03/2002 
TIME: 15:41:56 







552 


E- 


-> 


555 






556 


E- 


-> 


559 






560 


E- 


-> 


563 






564 




^> 


567 






568 


E- 


-> 


571 






572 


E- 


-> 


575 






576 


E- 


-> 


579 






580 


E- 


-> 


583 






584 


E- 


-> 


587 






588 


E- 


-> 


591 






592 


E- 


-> 


595 






596 




r> 


599 






600 


E- 


-> 


603 






604 


E- 


-> 


607 






608 


E- 


-> 


611 






612 


E- 


-> 


615 






616 


E- 


-> 


619 






620 


E- 


-> 


623 






624 


E- 


-> 


627 






628 






631 






637 






640 






643 






646 






652 






655 






658 






664 


E- 


-> 


667 



Input Set : 
Output Set: 



A:\ES.txt 

N5\CRr3\01032002\JOl5989-raw 



<400> SEQUENCE: 8 



gggtaccacc 60 

ttcggctgcg gggtccccga ctacaaggac gacgacgacg cggccgctct 
tgctgccccc 120 

tttgatgatg atgacaagat cgttgggggc tatgctctag aggccggtca 
gtggccctgg 180 

caggtcagca tcacc^t^ aggcgtccat. gtg^tgtggtg 'gctcts^tegt 
gtctgagcag 24 0 

tgggtgctgt cagctgctca ctgcttcccc agcgagcacc acaaggaagc 
ctatgaggtc 300 V 
aagctggggg cccaccagct agactcctac tccgaggacg ccaaggtcag 

caccctgaag 360 

gacatcatCG cccaccccag ctacctccag gagggctccc agggcgacat 
tgcactcctc .420 

caactcagca gacccatcac cttctcccgc tacatccggc ccatctgcct 

ccctgcagcc 480 



tcagtgagcc tcctgacgcc caagccactg cagcaactcg aggiigcctct 

gatcagtcgt 600 

gagacgtgta actgcctgta caacatcgac gccaagcctg aggagccgca 
ctttgtccaa 660 

gaggacatgg tgtgtgctgg ctatgtggag gggggcaagg acgcrctgcca 
gggtgactct 720 

gggggcccac tctcctgccc tgtggagggt ctctggtacc tgacgggcat 

tgtgagctgg 780 

ggagatgcct gtggggcccg caacaggcct ggtgtgtaca ctctggcctc 

cagctatgcc 840 

tcctggatcc aaagcaaggt gacagaactc cagcctcgtg tggtgcccca 

aacccaggag 900 

tcccagcccg acagcaacct ctgtggcagc cacctggcct tcagctctag 
acatcaccat 960 

caccatcact agcggccgct tccctttagt gagggttaat gcttcgagca 
gacatgataa 1020 

gatacattga tgagtttgga caaaccacaa ctagaatgca gtgaaaaaaa 

tgctttattt 1080 

gtgaaatttg tgatgctatt gctttatttg taaccattat aagctgcaat 
aaacaagttg 1140 
ac 

<210> SEQ ID NO: 9 
<211> LENGTH: 104 9 
<2l2> TYPE: DNA 

<213> ORGANISM: ARTIFICIAL SEQUENCE 
<220> FEATURE: 

<223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: 

WITH HOMO SAPIEN SERINE PROTEASE CATALYTIC DOMAIN 
<400> SEQUENCE: 9 
667 gaattcacca ccatggacag caaaggttcg tcgcagaaat cccgcctgct 




1142 



FUSION GENE 
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668 cctgctgctg 60 . 
E--> 671 gtggtgtcaa atctactctt gtgccagggt gtggtctccg actecaagga 

672 cgacgacgac 120 
E--> 675 gtggacgcgg ccgctcttgc tgcccccttt gatgatgatg acaagatcgt 

676 tgggggctac 180 
E--> 679 aactgtctag aaccccattc gcagccttgg caggcggcct tgttccaggg 

680 ccagcaacta 240 
E-:-> 683 ctctgtggcg gtgtccttgt aggtggcaac tgggtcctta cagctgccca • 

684 ctgtaaaaaa 300 
E--> 687 ccgaaataca cagtacgcct gggagaccac agcctacaga at:aaagatgg 

688 cccagagcaa 3 60 
E--> 691 gaaatacctg tggttcagtc catcccacac ccctgctaca acagcagcga 

692 tgtggaggac 4 20 
E--> 695 cacaaccatg atctgatgct tcttcaactg cgtgaccagg catccctggg 

696 gtccaaagtg 480 
E--> 699 aagcccatca gcctggcaga tcattgcacc cagcctggcc agaagtgcac 

700 cgtctcaggc 540 
E--> 703 tggggcactg tcaccagtcc ccgagagaat tttcctgaca ctctcaactg 

704 tgcagaagta 600 
E--> 707 aaaatctttc cccagaagaa gtgtigaggat gct:tacccgg ggcagatcac 

708 agatggcatg 660 
E--> 711 gtctgtgcag gcagcagcaa aggggctgac acgtgccagg gcgattctgg 

712 aggccccctg 720 
..Er-> 715 gtgtgtgatg gtgcactcca gggcatcaca tcctggggct cagacccctg 

716 tgggaggtcc 780 
E-,> 719 gacaaacctg gcgtctatac caacatctigc cgctacctgg actggatcaa 

720 gaagatcata 840 
E--> 723 ggcagcaagrg gctctagaca tcaccatcac calicactagc ggccgcttcc 

724 ctttagtgag 900 
E--> 727 ggttaatgct tcgagcagac atgataagat acaftgatga gtttggacaa 

728 accacaacta 960 
E--> 731 gaatgcagtg aaaaaaatgc tttatttgtg aaatttgtga tgctattgct 

732 ttatttgtaa 1020 

7 35 ccattataag ctgcaataaa caagttgac 1049 
741 <210> SEQ ID NO: 10 
744 <211> LENGTH: 1052 
747 <212> TYPE: DNA 

7 50 <213> ORGANISM: ARTIFICIAL SEQUENCE 
756 <220> FEATURE: 

759 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: FUSION GENE 

7 62 WITH HOMO SAPIEN SERINE PROTEASE CATALYTIC DOMAIN 

768 <400> SEQUENCE: 10 
E--> 771 gaattcacca ccatggacag caaaggttcg tcgcagaaat cccgcctgct 

772 cctgctgctg 60 
E--> 77 5 gtggtgtcaa atctactctt gtgccagggt gtggtctccg actacaagga 

776 cgacgacgac 120 
E-.> gtggacgcgg ccgctcttgc tgcccccttt gatgatgatg acaagatcgt 

780 tgggggctac 180 
E--> 783 aactgtctag aaaagcactc ccagccctgg caggcagccc tgttcgagaa 
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784 












E- 


-> 


787 




cgacgcuca^ 


cgcccccaga 




caocaoccca 






788 


ctgcctcaag 


inn 








E- 


- > 


791 


ccccgcuaca 


wagubcaccu 


ggggcagcaw 




aooaooaooo 






TOT 

792 


ctgt^agcag 


K;n 








E- 


- > 


795 


acccgcfacag 


ccactgagtc 


^ #v s 


r* ^ CTW^ ^ a 


acaacacTCCt 






79o 


CCCCaaCaaa 


A on 












o o 


a r^^n^A- 


a wgaCa^coLw 


^ CTfT^ fVH St CT 


' a^cTCTcatccrc 


cagtc tc c at 






onn 


caccugggcu 


Ann 








E- 


- > 


fi n o 
BO J 


gtgcgacccc 




Cvcacgc V^lw 


ertc a c to c to 


ocaccaocto 






804 




^A n 








E- 


- > 


ovl 


ggctggggca 


gcacguccag 


ccwccag 


WW W*J WW WW 


acaccttgcg 






808 


atgcgccaac 


f A A 

600 








E- 


- > 


811 


atcaccatca 


ttgagcacca 


gaagtgtgag 


a,a^U w w t WW 


ccooca aca. t 






812 


cacagacacc 


<; n 








E- 


- ^ 


a 1 4 


a ugg ug ug l.^ 




gg ct a 9 g c| «| «| w 


aaggactcct 


gccagggtga 






816 


ctccgggggc 


720 








E- 


-> 


819 


cctctggtct 


gtaaccagtc 


tcttcaaggc 


attatctcct 


ggggccagga 






820 


tccgtgtgcg 


780 








E- 


-> 


823 


atcacccgaa 


agcctggtgt 


ctacacgaaa 


gtctgcaaat 


atgrtggactg 






824 


gatccaggag 


840 








E- 


-> 


827 


acgatgaaga 


acaattctag 


acatcaccat 


caccatcact 


agcggccgct 






828 


tccctttagt 


900 • 








Er 


-:> 


831 


gagggttaat 


gcttcgagca 


gacatgataa 


gatacattga 


tgagtttgga 






832 


caaaccacaa 


960 








E- 


-> 


835 


ctagaatgca 


gtgaaaaaaa 


tgctttattt 


gtgaaatttg 


tgatgctatt 






836 


gctttatttg 


1020 












839 


taaccattat 


aagctgcaat 


aaacaagttg 


ac gSj 




^^^^ _ 1052 

1067 <210> SEQ ID NO: 12 
1070 <211> LENGTH: 319 
1073 <212> TYPE: PRT 

1076 <213> ORGANISM: ARTIFICIAL SEQUENCE 

1082 <220> FEATURE: 

1085 <2 23> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: FUSION GENE 
1088 WITH HOMO SAPIEN SERINE PROTEASE CATALYTIC DOMAIN 

1094 <400> SEQDENrF- 10- 

1097 ^gT ALA PHE LEU TRP LEU LEU SER CyS TRP ALA LEU LEU GLY 

1100^:^^ !> TXJ " ^ 351. ^7 

11061 

1109 — . 

1115 LEU ALA ALA PRO PHE ASP ASP ASP ASP LYS ILE VAL GLY GLY TYR ALA*^ 
1118 35 40 45 

1124 LEU GLU ALA GLY GLN TRP PRO TRP GLN VAL SER ILE THR TYR GLU GLY-^ 
1127 50 55 60 ^ 

1133 VAL HIS VAL CYS GLY GLY 



3QUE 

GLY CYS GLY VAL PRO ASP T YR LYS A SP ASP ASP ASP AL A ALA ^A L^ Jf/^ A II ) 

^ 20"- \ 25 JB ^ -JuMJ^ ^ ^' ^ 

^jTt^ 

SER LEU VAL SER GLU GLN TRP VAL LEU SER^ ^J^^jJ^^^/^^^ A 

1136 65 70 75 80 ^ / *j 

1142 ALA ALA HIS CYS PHE PRO SER GLU HIS HIS LYS GLU ALA TYR GLU VAL ^Lr>-^^ 
1145 85 90 95 ^ 

1151 LYS LEU GLY ALA HIS GLN LEU ASP SER TYR SER GLU ASP ALA LYS VAL AJ^f^^"^""^^^^^ 
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E--> 
E--> 

Er-> 



1154 








lUU 










1 

xuo 










Tin 
xxu 






1160 


SER 


THR 


LEU 


LYS 


ASP 


ILE 


ILE 


PRO 


UT6 

HIS 


rRU 


CPS 


1 XK 


T l?ft 
XjCtU 




IT 


tlT V 
wXiX 


1163 






lie 




















X^ 3 








11d9 


SLR 


ULN 


uiil 


R C D 


ILE 


ALA 


T PTT 


T PIT 










It IV Li 


TT.P 


TYIP 


PHP 


11 / ^ 




1 in 




















X *i V 










11 /o 


SER 


AKU 


rp vrs 
i IK 


TT TP 
ILL 


ARG 


PRO 


ILE 




T IPTT 




^ T H 
AIjA 




Aon 


ATA 




erne* 


llol 


143- 










150 










1 

X D 










X w w 


1187 


PIIO 


ASN 


GLY 


LEU 


HI:S.^YS 


THR 


VAL 


THR 


GLY 


TRP. 


GLY 


HIS 


VAL 


ALA 


PRO 


1 1 on 
ll^U 










165 










1 7A 










175 




1 1 0£ 

jLiy o 


ObK 


VAJj 






LEU 


THR 


Don 


T VC 










T.PfT 


CI J. XI 


VAL 


PRO 










1 on 




















X 7 v 








r i?f f 


TT IP 


a£tK 


A or* 


GLU 


THR 


r*vc 






T.Pfl 


X xn 




TT F 


n0 It 


ALA 


LYS 








±73 




















205 










irKU 


UJjU 




FKU 


HIS 


PHE 


V£\Lt 








MPT 


VAL 


CYS 


ALA 


GLY 


TYR 


1217 




21U 










ZlD 










Ton 










1223 


VAL 


GLU 


GLY 


GLY 


LYS 


ASP 


ALA 


CYS 


GLN 


GLY 


ASP 


SER 


GLY 


GLY 


PRO 


LEU 


1226 


225 










230 










235 










240 


1232 


SER 


CYS 


PRO 


VAL 


GLU 


GLY 


LEU 


TRP 


TYR 


LEU 


THR 


GLY 


ILE 


VAL 


SER 


TRP 


1235 










245 










250 










255 




1241 
1242 


GLY 


ASP 


ALA 


CYS 


GLY 


ALA 


ARG 


ASN 


ARG 


PRO 


GLY 


VAL 


TYR 


THR 


LElJrv^ 


ALA. 
































1245 








260 










265 










270 






1251 


SER 


SER 


TYR 


ALA 


SER 


TRP 


ILE 


GLN 


SER 


LYS 


VAL 


THR 


GLU 


LEU 


GLN 


PRO 


1254 






275 










280 










285 








1260 


ARG 


VAL 


VAL 


PRO 


GLN 


THR 


GLN 


GLU 


SER 


GLN 


PRO 


ASP 


SER 


ASN 


LEU 


CYS 


1263 




290 










295 










300 










1269 


GLY 


SER 


HIS 


LEU 


ALA 


PHE 


SER 


SER 


ARG 


HIS 


HIS 


HIS 


HIS 


HIS 


HIS 





310 



315 



35 



1272 305 

2409 <210> SEQ ID NO: 
2412 <211> LENGTH: 55 
2415 <212> TYPE: DNA 

2418 <213> ORGANISM: ARTIFICIAL SEQUENCE 
2424 <220> FEATURE: 

2427 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: 
2430 OLIGONUCLEOTIDE 

2436 <400> SEQUENCE: 35 ^ 
24 39 aattcaccaG_catggctttc ctctggctcc tctcctgctg ggccctcctg ^ ^/7^v/v/ yO^'iJ^ 



2440<ggtac 



2446 <2lUi^ SKU II) NO: 36 
2449 <211> LENGTH: 47 
2452 <212> TYPE: DNA 

24 55 <213> ORGANISM: ARTIFICIAL SEQUENCE 
2461 <220> FEATURE: 

2464 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: 
24 67 OLIGONUCLEOTIDE 
2473 <400> SEQUENCE: 36 

24 76 ccaggagggc ccagcaggag aggagccaga ggaaagccat ggtggtg 

2477 47 

2483 <210> SEQ ID NO: 37 



cfh Cwv/ 

JU 
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2486 <211> LENGTH: 45 
2489 <212> TYPE: DNA 

2492 <213> ORGANISM: ARTIFICIAL SEQUENCE 
2498 <220> FEATURE: 

2501 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: 
2504 OLIGONUCLEOTIDE 
2510 <400> SEQUENCE: 37 
E-_V 2513 caccttcggc tgcggggtcc ccgactacaa ggacgacgac gacgc 
2514 45 
2520 <210> SEQ ID NO: 38 

2523 <211> LENGTH: 53 ' V 

2526 <212> TYPE: DNA 

2529 <213> ORGANISM: ARTIFICIAL SEQUENCE 
2535 <220> FEATURE: 

2538 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: 
.2541 OLIGONUCLEOTIDE 
2547 <400> SEQUENCE: 38 jLn^^^^ 
E--> 2550 ggccgcgtcg tcgtcgtcct tgtagtcggg gaccccgcag ccgaaggtgg 

2551 tac 53 
2629 <210> SEQ ID NO: 41 
2632 <211> LENGTH: 55 
2635 <212> TYPE:. DNA 

2638 <213> ORGANISM: ARTIFICIAL SEQUENCE 
2644 <220> FEATURE: . . 

2647 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: 
2650 - OLIGONUCLEOTIDE 
2656 <400> SEQUENCE: 41 
E--> 2659 ggccgctctt gctgccccct ttgatgatga tgacaagatc gttgggggct ^^^^-/v'^ 

2660 atgct 55 . ^ 

2666 <210> SEQ ID NO: 42 
2669 <211> LENGTH: 55 
2672 <212> TYPE: DNA 

2675 <213> ORGANISM: ARTIFICIAL SEQUENCE 
2681 <220> FEATURE: 

2684 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: 
2687 OLIGONUCLEOTIDE 

2693 <400> SEQUENCE: 42 _^r>^ 
E--> 2696 ctagagcata gcccccaacg atcttgtcat catcatcaaa gggggcagca Z)^^^'^''^^^'^ 
2697 agagc 55 . 

2703 <210> SEQ ID NO: 43 
2706 <211> LENGTH: 55 
2709 <212> TYPE: DNA 

2712 <213> ORGANISM: ARTIFICIAL SEQUENCE 
2718 <220> FEATURE: 

2721 <223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE: 
2724 OLIGONUCLEOTIDE 
2730 <400> SEQUENCE: 43 
E--> 27 33 ggccgctctt gctgccccct ttgatgatga tgacaagatc gttgggggct /\yLf/^^ 



2734 attgt 55 
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2740 
2743 
2746 
2749 

2755 
2758 
2761 
2767 

E--> 2770 
2771 
2777 
2780 
2783 
2786 
2792 
2795 
2798 
2804 

E--> 2807 
2808 
2814 
2817 
2820 
2823 
2829 
2832 
2835 
2841 

E--> 2844 
2845 
3280 
3283 
3286 
3289 
3295 
3298 
3301 
3307 



Input Set : 
Output. Set: 



44 



A:\ES.txt 

N:\CRF3\01032002\J0l5989.raw 



<210> SEQ ID NO 
<211> LENGTH: 55 
<212> TYPE: DNA 

<213> ORGANISM: ARTIFICIAL SEQUENCE 

<220> FEATURE: 

<223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE 

OLIGONUCLEOTIDE 
<400> SEQUENCE: 44 

ctagacaata gcccccaacg atcttgtcat cat.catcaaa gggggcagca 




agagc 



55 



<210> SEQ ID NO: 45 
<211> LENGTH: 52 
<212> TYPE: DNA 

<213> ORGANISM: ARTIFICIAL SEQUENCE 

<220> FEATURE: 

<223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE 

OLIGONUCLEOTIDE 
<400> SEQUENCE: 4 5 

ggccgctctt gctgccccct ttatcgaggg gcgcattgtg gagggctcgg 

at 52 
<210> SEQ ID NO: 46 
<211> LENGTH: 52 
<212> TYPE: DNA 

<213> ORGANISM: ARTIFICIAL SEQUENCE 
<220> FEATURE: 

<223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE 

OLIGONUCLEOTIDE 
<400> SEQUENCE: 46 

ctagatccga gccctccaca atgcgcccct cgataaaggg ggcagcaaga 
gc 52 
<210> SEQ ID NO: 54 
<211> LENGTH: 284 
<212> TYPE; PRT 

<213> ORGANISM: ARTIFICIAL SEQUENCE 
<220> FEATURE: 

<223> OTHER INFORMATION: DESCRIPTION OF ARTIFICIAL SEQUENCE 

PROTEASE IN PPEK ZYMOGEN VECTOR 
<400> SEQUENCE: 54 





HUMAN MH2 



331D 


MET 


ASP 


SER 


LYS 


GLY 


SER 


SER 


GLN 


LYS 


SER 


ARG 


LEU 


LEU 


LEU 


LEU 


LEU 


3313 


1 








5 










10 










15 




3319 


VAL 


VAL 


SER 


ASN 


LEU 


LEU 


LEU 


CYS 


GLN 


GLY 


VAL 


VAL 


SER 


ASP 


TYR 


LYS 


3322 








20 










25 










30 






3328 


ASP 


ASP 


ASP 


ASP 


VAL 


ASP 


ALA 


ALA 


ALA 


LEU 


ALA 


ALA 


PRO 


PHE 


ASP 


ASP 


3331 






35 










40 










45 








3337 


ASP 


ASP 


LYS 


ILE 


VAL 


GLY 


GLY 


TYR 


ASN 


CYS 


LEU 


GLU 


PRO 


HIS 


SER 


GLN 


3340 




50 










55 










60 










3346 


PRO 


TRP 


GLN 


ALA 


ALA 


LEU 


VAL 


MET 


GLU 


ASN 


GLU 


LEU 


PHE 


CYS 


SER 


GLY 


3349 


65 










70 










75 










80 


3355 


VAL 


LEU 


VAL 


HIS 


PRO 


GLN 


TRP 


VAL 


LEU 


SER 


ALA 


ALA 


HIS 


CYS 


PHE 


GLN 
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E--> 
E--> 
E--> 
E--> 
E--> 
E-T> 
E--> 



E--> 
E--> 
E--> 
E--> 
E--> 
E--> 
E--> 



3358 










85 










90 










95 




3364 


ASN 


SER 


TYR 


THR 


ILE 


GLY 


LEU 


^GLY 


LEU 


HIS 


SER 


LEU 


GLU 


ALA 


ASP 


GLN 


3367 








100 










105 










110 






3373 


GLU 


PRO 


GLY 


SER 


GLN 


MET 


VAL 


GLU 


ALA 


SER 


LEU 


SER 


VAL 


ARG 


HIS 


PRO 


3376 






115 










120 










125 








3382 


GLU 


TYR 


ASN 


ARG 


PRO 


LEU 


LEU 


ALA 


ASN 


ASP 


LEU 


MET 


LEU 


ILE 


LYS 


LEU " 


3385 




130 










135 










140 










3391 


ASP. 


GLO; 


SER 


VAL 


SER 


GLU 


SER 


ASP 


THR 


ILE 


ARG 


SER 


ILE 


SER 


ILE 


ALA 


3394 


145 










150 










155 










160 


3400 


SER 


GLN 


CYS 


PRO 


THR 


ALA 


GLY 


ASN 


SER 


CYS 


LEU 


VAL 


SER 


GLY 


TRP 


GLY 


3403 










165 










170 










175 




3409 
3410j 
3413 




LEU 


ALA 


ASN 


GLY 


ARG 


MET 


PRO 


THR 


VAL 


LEU 


GLN 


CYS 


VAL 


ASN ^ /TT^ 






180 










185 










190 






3419 


SER 


VAL 


VAL 


SER 


GLU 


GLU 


VAL 


CYS 


SER 


LYS 


LEU 


TYR 


ASP 


PRO 


LEU 


TYR 


3422 






195 










200 










205 








3428 


HIS 


PRO 


SER 


HET 


PHE 


CYS 


ALA 


GLY 


GLY 


GLY 


HIS 


ASP 


GLN 


LYS 


ASP 


SER 


3431 




210 










215 










220 










3437 


CYS 


ASN 


GLY 


ASP 


SER 


GLY 


GLY 


PRO 


LEU 


ILE 


CYS 


ASN 


GLY 


TYR 


LEU 


GLN 


3440 


225 










230 










235 










240 


3446 


GLY. 


LEU 


VAL 


SER 


PHE 


GLY 


LYS 


ALA 


PRO 


CYS 


GLY 


GLN 


VAL 


GLY 


VAL 


PRO 


3449 










245 










250 










255 




3455 


GLY 


VAL 


TYR 


THR 


ASN 


LEU 


CYS 


LYS 


PHE 


THR 


GLU 


TRP 


ILE 


GLU 


LYS 


THR 


34 58 








260 










265 










270 






3464 


VAL 


GLN 


ALA 


SER 


SER 


ARG 


HIS 


HIS 


HIS 


HIS 


HIS 


HIS 











3467 



280 



275 

3608 <210> SEQ ID NO: 59 
3611 <211> LENGTH: 1103 
3614 <212> TYPE: DNA 

3617 <213> ORGANISM: ARTIFICIAL SEQUENCE 
3623 <220> FEATURE: 

<223> OTHER INFORMATION: DESCRIPTION OF ARTI 
SEQUENCE OF HUMAN PROTEASE F IN CFEK2 
VECTOR 
3638 <400> SEQUENCE: 59 

364 1 gaattc^rc;a rMtggcttt cctctggctc ctctcctgct 
2(^rtgtaccacc_^60^ ~ 



3626 
3629 
3632 



364: 



FICIAL SEQUENCE: NUCLEIC ACID 
ZYMOGEN 



gggccctcct 



3 64 5 ttcggctgcg gggtccccga ctacaaggac gacgacgacg cggccgctct 

3 646 tgctgccccc 120 

3 64 9 tttgatgatg atgacaagat cgttgggggc tatgctctag aactcgggcg 

3650 ttggccgtgg 180 

3653 caggggagcc tgcgcctgtg ggattcccac gtatgcggag tgagcctgct 

3654 cagccaccgc 240 

3657 tgggcactca . cggcggcgca ctgctttgaa acctatagtg accttagtga 

3658 tccctccggg 300 

3661 tggatggtcc agtttggcca gctgacttcc atgccatcct tctggagcct 

3662 gcaggcctac 360 

3665 tacaaccgtt acttcgtatc gaatatctat ctgagccctc gctacctggg 

3666 gaattcaccc 420 




4 
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RAW SEQUENCE LISTING DATE: 01/03/2002 

PATENT APPLICATION: US/10/015^ 989 TIME: 15:41:56 

Input Set : As\ES.tzt 

Output Set: N:\CRr3\01032002\J015989.raw 

E--> 3669 tatgacattg ccttggtgaa gctgtctgca cctgtcacct acactaaaca 

3670 catccagccc 480 " 
E--> 3673 atctgtctcc aggcctccac atttgagttt gagaaccgga cagactgctg 

3674 ggtgactggc 540. 
E--> 3677 tgggggtaca tcaaagagga tgaggcactg ccatctccce acaccctcca 

3678 ggaagttcag 600 . 
E--> 3681 gtcgccatca taaacaactc tatgtgcaac cacctcttcc tcaagtacag 

3682 tttpcgcaag 660 
E--> 368 5 gacatctttg gagacatggt ttgtgctggc aatgcccaag gcgggaagga 

3686 tgcctgcttc 720 
E--> 3689 ggtgactcag gtggaccctt ggcctgtaac aagaatggac tgtggt^4tca 

3690 gattggagtc 780 
E--> 3693 gtgagctggg gagtgggetg tggtcggccc aatcggcccg gtgtctacac 

3694 caatatcagc 840 
E--> 3697 caccactttg agtggatcca gaagctgatg gcccagagtg gcatgtccca 

3698 gccagacccc 900 
E--> 3701 tcctggtcta gacatcacca tcaccatcac tagcggccgc ttccctttag 

3702 tgagggttaa 960 
E--> 3705 tgcttcgagc agacatgata agatacattg atgagtttgg acaaaccaca 

3706 actagaatgc 1020 
E--> 3709 agtgaaaaaa atgctttatt tgtgaaattt gtgatgctat tgctttattt 

3710 gtaaccatta 1080 

3713 taagctgcaa taaacaagtt gac 1103 




..i-i.-MiinioM is ijiesoiucii i» ilie <210> to <22i> heldsot 
jiicli sequence using a or Xaa. 
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VERIFICATION SUMMARY DATE: 01/03/2002 

PATENT APPLICATION: US/10/015, 989 TIME: 15:41:57 

Input Set : A:\ES.tzt 

Output Set: N:\CRr3\01032002\J0159a9.raw 

L:31 M:270 C: Current Application Number differs. Replaced Application Number 

L:34 M:271 C: Current Filing Date differs. Replaced Current Filing Date 

L:82 M:254 E: No. of Bases conflict, LENGTH: Input : 0 Counted:50 SEQ:1 

L:82 M:112 C: (48) String data converted to lower case, 



M 


1 1 o 


Kepea uea 


-i n 


OC(jWU"X 














M . 


254 


Repeated 


in 


CA#«M<-k«1 

oeqNO""! 














Li . 




M ■ p • 

n . zjh cj-. 


wo . 




\^\Jli X X i> f 


LENGTH • 
xj£i» w X n ■ 


Tnnut'O Counted* 


50 


SEQ: 


2 


M : 


112 


Repeated 


Jin 


begNO^^ 














M ■ 


2!>4 


Repeated 


4 n 


oegNO"-^ 














T • 
Ij 


ion 




viLJ . 




^WtlX X Xw ^ f 


LENGTH • 


Xnout'O Counted: 


50 


SEQ: 


3 


M * 


Liz 


Kepea L.ea 


xn 


jseqwo" J 














M : 


2!)4 


KepeaLea 


j.n 


oecj[NO'^ J 














T . 

Li . 


D 


M OS A P - 

n m ^ JH dm 


Nn 




f'f^n f 1 ict - 

X X «t- \^ f 


LENGTH : 


Input :0 Counted: 


50 


SEQ: 


4 


M 




Kepcai,eQ 


xn 
















M 


n C il 

234 


Repeated 


in 


oecjNo=** 














T 

L . 


J 2 0 


M . Z34 £i . 


riO . 


OX oases 


OviiiX XXww f 


LENGTH • 


TnDut'O Counted* 


50 


SEQ: 


5 


M : 


111 


Kepeaueu 


in 


oeCjNO— 3 














M 


254 


Repeated 


in 


SeqNo-5 














T 
Li 


383 


M:254 E: 


No. 


of Bases 


conflict, 


LENGTH : 


Input :0 Counted: 


50 


SEQ: 


6 


M 


112 


Repeated 


in 


SeqNo=6 














M 


254 


Repeated 


in 


SeqNo=*6 














L- 


443 


M:254 E: 


No. 


of Bases 


conflict. 


LENGTH : 


Input :0 Counted: 


50 


SEQ: 


7 


M 


112 


Repeated 


in 


ScqNo=7 














M' 


254 


Repeated 


in 


SeqNo«7 














L, 


S55 


M: 254 E: 


No. 


of Bases 


conflict. 


LENGTH: 


Input :0 Counted: 


50 


SEQ: 


8 


M. 


112 


Repeated 


in 


SeqNo=8 














M: 


254 


Repeated 


in 


SeqNO"8 














L. 


667 


M:254 E: 


No. 


of Bases 


conflict, 


LENGTH : 


input :0 Counted: 


50 


SEQ: 9 


M: 


112 


Repeated 


in 


SeqNo=9 














M: 


254 


Repeated 


in 


SeqNo«9 














L: 


771 


M:254 E: 


No. 


of Bases 


conflict. 


LENGTH : 


Input :0 Counted: 


50 


SEQ: 


10 


M: 


112 


Repeated 


in 


SeqNo-10 














M: 


254 


Repeated 


in 


SeqNo=10 














L: 


1242 M:332 E 


( 32 ) Invalid/Missing 


Amino Acid Numbering, SEQ 


10:12 


M- 


332 


Repeated 


in 


SeqNo=12 















L: 


1719 


M: 


112 


C 


(48) 


String 


data 


converted 


to 


lower 


case. 


L: 


1755 


M: 


112 


C 


' (48) 


String 


data 


converted 


to 


lower 


case. 


L: 


1791 


M: 


112 


C 


(48) 


String 


data 


converted 


to 


lower 


case. 


L: 


1827 


M: 


112 


C 


(48) 


String 


data 


converted 


to 


lower 


case. 


L: 


1863 


M: 


112 


C 


(48) 


String 


data 


converted 


to 


lower 


case, 


L: 


1899 


M: 


112 


C 


(48) 


String 


data 


converted 


to 


lower 


case. 


L: 


1935 


M: 


112 


C 


(48) 


String 


data 


converted 


to 


lower 


case, 


L: 


1971 


M: 


112 


C 


. (48) 


String 


data 


converted 


to 


lower 


case, 


L: 


2007 


M: 


112 


C 


(48) 


String 


data 


converted 


to 


lower 


case. 


L: 


2043 


M: 


112 


C 


(48) 


String 


data 


converted 


to 


lower 


case. 


L: 


2079 


M: 


112 


C 


(48) 


String 


data 


converted 


to 


lower 


case. 


L: 


2115 


M: 


112 


C 


(48) 


String 


data 


converted 


to 


lower 


case. 


L: 


2151 


M: 


112 


C 


: (48) 


String 


data 


converted 


to 


lower 


case, 
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VERIFICATION SUMMAHY 

PATENT APPLICATION: US/10/015,989 



DATE: 01/03/2002 
TIME: 15:41: 57 



L:2187 M:112 C 
L:2223 M:112 C 
L:2259 M:112 C 
L:2295 M:112 C 
L:2331 M:112 C 
L:2367 M:112 C 
L:2403 M:112 C 
L:2439 M:254 E 
M:112 Repeated 
L:2476 M:254 E: 
M:112 Repeated 
L:25.13 M:254 E: 
M:112 Repeated 
L:2550 M:254 E: 
M:112 Repeated 
L:2587 M:112 C: 
L:2623 M:112 C: 
L:2659 M:254 E: 
M:112 Repeated 
L:2696 M:254 E: 
M:112 Repeated 
L:2733 M:254 E: 
M:112 Repeated 
L:2770 M:254 E: 
M:112 Repeated 
L:2807 M:254 E: 
M:112 Repeated 
L:2844 M:254 E: 
M:112 Repeated 
L:2881 M:112 C 
L:2917 M:112 C 
L:2953 M:112 C 
L:2989 M:112 C 
L:3025 M:112 C 
L:3061 M:112 C 
L: 3410 M: 332 E 
M:332 Repeated 
L:3503 M:112 C 
L:3641 M:254 E 
M:254 Repeated 
L:3752 M:254 E 
M:254 Repeated 



Input Set : A:\ES.^t: 

Output Set: Ks\CKF3\01032002\JO 15989 .raw 

(48) String data converted to lower case, 
(48) String data converted to lower case, 
(48.) String data converted to lower case, 
(48) String data converted to lower case, 
(48) String data converted to lower case, 
(48) String data converted to lower case, 
(48) String data converted to lower case. 
No. of Bases conflict, LENGTH: Input :0 Counted: 50 SEQ:35 
in SeqNo=35 

No. of Bases conflict, LENGTH : Input : 0 Counted:47 i5EQ;36 
in SeqNo-36 ' 

No. of Bases conflict, LENGTH: Input: 0 Counted: 45 SEQ:37 
in SeqNo-37 

NO. of Bases conflict, LENGTH: Input :0 Counted:50 SEQ:38 
in SeqNo=38 

(4 8) String data converted to lower case, 

(48) String data converted to lower case, 

No. of Bases conflict, LENGTH : Input : 0 Counted: 50 SEQ:41 
in SeqNo=41 

LENGTH: Input: 



No. of Bases . conflict, 
in SeqNo=42 

No. of Bases Conflict, 
in SeqNo*'4 3 

No. of Bases conflict, 
in SeqNo=44 

No. of Bases conflict, 
in SeqNo=4 5 
No. of Bases conflict, 



Counted: 50 SEQ:42 
Counted: 50 SEQ:4 3 



:0 Counted: 50 SEQ:44 



LENGTH: Input: 
LENGTH: Input: 
LENGTH: Input :0 Counted: 50 SEQ:45 
LENGTH: Input: 0 Counted: 50 SEQ:46 



in SeqNo=46 

(48) String data converted to lower case, 

(48) String data converted to lower case, 

(48) String data converted to lower case, 

(48) String data converted to lower case, 

(48) String data converted to lower case, 

(48) String data converted to lower case, 

(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 54 
in SeqNo=54 

(48) String data converted to lower case, 

No. of Bases conflict, LENGTH: Input :0 Counted: 50 SEQ: 59 
in SeqNo=59 

NO. of Bases conflict, LENGTH: Input :0 Counted: 50 SEQ: 60 
in SeqNo-60 
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Defective images within this document are accurate representations of the original 
documents submitted by the appUcant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SffiES 

□ FADED TEXT OR DRAWING 



□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: . — — 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 
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BLURRED OR ILLEGIBLE TEXT OR DRAWING 



